Local Graph Clustering Beyond Cheeger's Inequality

نویسندگان

  • Zeyuan Allen Zhu
  • Silvio Lattanzi
  • Vahab Mirrokni
چکیده

Motivated by applications of large-scale graph clustering, we study random-walk-based local algorithms whose running times depend only on the size of the output cluster, rather than the entire graph. All previously known such algorithms guarantee an output conductance of Õ( √ φ(A)) when the target set A has conductance φ(A) ∈ [0, 1]. In this paper, we improve it to Õ ( min {√ φ(A), φ(A) √ Conn(A) }) , where the internal connectivity parameter Conn(A) ∈ [0, 1] is defined as the reciprocal of the mixing time of the random walk over the induced subgraph on A. For instance, using Conn(A) = Ω(λ(A)/ log n) where λ is the second eigenvalue of the Laplacian of the induced subgraph onA, our conductance guarantee can be as good as Õ(φ(A)/ √ λ(A)). This builds an interesting connection to the recent advance of the so-called improved Cheeger’s Inequality [KLL13], which says that global spectral algorithms can provide a conductance guarantee of O(φopt/ √ λ3) instead of O( √ φopt). In addition, we provide theoretical guarantee on the clustering accuracy (in terms of precision and recall) of the output set. We also prove that our analysis is tight, and perform empirical evaluation to support our theory on both synthetic and real data. It is worth noting that, our analysis outperforms prior work when the cluster is wellconnected. In fact, the better it is well-connected inside, the more significant improvement (both in terms of conductance and accuracy) we can obtain. Our results shed light on why in practice some random-walk-based algorithms perform better than its previous theory, and help guide future research about local clustering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Cheeger's Inequality

In Ch], Cheeger proved the following general lower bound for the rst eigenvalue 1 of a closed Riemannian manifold: Theorem ((Ch]): 1 1 4 h 2 ; where h = inf N area(N) min(vol(A); vol(B)) where N runs over (possibly disconnected) hypersurfaces of M which divide M into two pieces A and B, and where area denotes (n ? 1)-dimensional volume, and vol denotes n-dimensional volume, where n = dim(M). h(...

متن کامل

Bounds on the L2 Spectrum for Markov Chains and Markov Processes: a Generalization of Cheeger's Inequality

We prove a general version of Cheeger's inequality for discretetime Markov chains and continuous-time Markovian jump processes, both reversible and nonreversible, with general state space. We also prove a version of Cheeger's inequality for Markov chains and processes with killing. As an application, we prove L2 exponential convergence to equilibrium for random walk with inward drift on a class...

متن کامل

Improved Cheeger's Inequality and Analysis of Local Graph Partitioning using Vertex Expansion and Expansion Profile

We prove two generalizations of the Cheeger’s inequality. The first generalization relates the second eigenvalue to the edge expansion and the vertex expansion of the graph G, λ2 = Ω(φ V (G) · φ(G)), where φ (G) denotes the robust vertex expansion of G and φ(G) denotes the edge expansion of G. The second generalization relates the second eigenvalue to the edge expansion and the expansion profil...

متن کامل

The Dirichlet Problem at Infinity for Random Walks on Graphs with a Strong Isoperimetric Inequality

We study the spatial behaviour of random walks on innnite graphs which are not necessarily invariant under some transitive group action and whose transition probabilities may have innnite range. We assume that the underlying graph G satis-es a strong isoperimetric inequality and that the transition operator P is strongly reversible, uniformly irreducible and satisses a uniform rst moment condit...

متن کامل

Lecture : Spectral Methods for Partitioning Graphs ( 2 of 2 )

Warning: these notes are still very rough. They provide more details on what we discussed in class, but there may still be some errors, incomplete/imprecise statements, etc. in them. Here, we will prove the easy direction and the hard direction of Cheeger's Inequality. Recall that what we want to show is that λ 2 2 ≤ φ(G) ≤ 2λ 2. For the easy direction, recall that what we want to prove is that...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013